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Abstract 

Many networks contain correlations and often conventional analysis 
is incapable of incorporating this often essential feature. In arXiv:070 8.2176| 
we introduced the link-space formalism for analysing degree-degree cor- 
relations in evolving networks. In this extended version, we provide 
additional mathematical details and supplementary material. 

We explore some of the common oversights when these correlations 
are not taken into account, highlighting the importance of the formal- 
ism. The formalism is based on a statistical description of the fraction 
of links connecting nodes of degrees i and j. To demonstrate its 
use, we apply the framework to some pedagogical network models, 
namely, random- attachment, Barabasi- Albert preferential attachment 
and the classical Erdos and Renyi random graph. For these three 
models the link-space matrix can be solved analytically. We apply the 
formalism to a simple one-parameter growing network model whose 
numerical solution exemplifies the effect of degree-degree correlations 
for the resulting degree distribution. We also employ the formalism to 
derive the degree distributions of two very simple network decay mod- 
els, more specifically, that of random link deletion and random node 
deletion. The formalism allows detailed analysis of the correlations 
within networks and we also employ it to derive the form of a perfectly 
non-assortative network for arbitrary degree distribution. 
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1 Introduction 



Networks - in particular large networks with many nodes and links - are 
attracting widespread attention. The classic reviews [H [20l [35] with their 
primary focus on structural properties have been followed up by more recent 
ones addressing the role of dynamics, such as spreading and synchronisation 
processes on networks, as well as the role of weights and mesoscopic struc- 
tures, i.e. cliques (fully connected subgraphs) and communities (groups of 
densely interconnected nodes), within networks [10^ I36j. 

Although several different measures for characterising networks have 
been presented, for example in a recent survey [19], the simple concept 
of vertex degree remains unrivalled in its ability to capture fundamental 
network properties. When comparing the degrees of connected vertices, 
however, one often finds that they are correlated, a quality that gives rise 
to a rich set of phenomena [181 l30] 134] . Degree correlations constitute a 
central role in network characterisation and modelling but, in addition to 
being important in their own right, also have substantial consequences for 
dynamical processes unfolding on networks. Given the increasing current 
interest in network dynamics, understanding structural correlations remains 
important and timely. 

In this paper we provide a detailed mathematical formalism for mod- 
elling networks with correlations. It is built around a statistical descrip- 
tion of inter-node linkages as opposed to single-node degrees. Most works 
devoted to analytical calculations of correlations in models have been per- 
formed only for particular cases [6]. We provide a brief overview of degree 
correlations for network structure and dynamics in Section [2] and discuss 
how the work presented relates to existing studies. Section [3] contains the 
notation used throughout the rest of the paper. We highlight the impor- 
tance of acknowledging these correlations in Section SI through reviewing a 
model of network growth proposed by Saramaki and Kaski [501 (and sub- 
sequently by Evans and Saramaki [23]) which employs random walkers to 
grow a network. The link-space formalism, which lies at the core of the pa- 
per, is introduced in Section El The formalism comprises a master equation 
description of the evolution of a very specific matrix construction which 
we term the link-space matrix. The formalism can be implemented in a 
number of ways. To demonstrate its use, we apply it to two well-known, 
non-equilibrium examples, namely random-attachment and Barabasi- Albert 
(BA) preferential-attachment networks [UIH] in Section [5] and solve the so- 
called link-space matrix analytically for these models. In Section El we also 
apply it to the classical equilibrium random graph of Erdos and Renyi [22] 
(ER) using some of the ideas discussed in [H] which, interestingly, requires 
a full, time-dependent solution of the link-space master equations. The 
cumulative link-space introduced in Section El aids comparison between sim- 
ulated and analytic link-space matrices and could, in principle, be applied 
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to empirical networks to better ascertain their degree correlations. Analytic 
derivation of link-space matrices allows detailed analysis of the degree-degree 
correlations present within these networks as is demonstrated in Section [6l 
Within Section [6l we also show how, for arbitrary degree distribution, a 
link-space matrix can be derived which has no correlations present, i.e. the 
link-space is representative of a perfectly non-assortative network with that 
degree distribution. We then consider the counter-intuitive prospect of find- 
ing steady states of decaying networks in Section [71 In Section El we discuss 
the Growing Network with Redirection (GNR) model of Krapivsky and Red- 
ner [30]. In Section El we introduce a simple one-parameter network growth 
algorithm with a similar redirection process which is able to produce a wide 
range of degree distributions. The model is interesting in its own right in 
the sense that it makes use of only local information about node degrees. 
Here the link-space formalism allows us to identify the transition point at 
higher exponent degree distibutions switch to lower exponent distributions 
with respect to the BA model. We conclude in Section [lOl 

2 Overview of degree correlations 

A discussion of correlations might naturally start with the phenomenon of 
clustering. The highly influential paper of Barabasi and Albert and many 
subsequent papers focussed on the modelling and analysis of models which 
replicate power-law degree distributions observed in many real-world sys- 
tems [HEIIES]- However, the BA preferential attachment mechanism lacks 
certain key features observed in many of those systems, one such feature be- 
ing clustering. Clustering is an important local statistic in many networks 
and reflects the connectedness of neighbours of a node. In a social context, if 
person Y is friends with both X and Z, one might expect some kind of link 
between X and Z. The measurement of the connectedness of neighbours 
of nodes is the clustering coefficient. This is often averaged over all nodes 
and a single value is given as the clustering coefficient of the network [20] . 
An explicit example is given in Figure [H The node labelled i has ki neigh- 
bours (in this case 5). Between these there could be a possible ki{ki — l)/2 
undirected links of which y exist. For this example, 3 links exist out of the 
possible 10 between the neighbours of node i. The clustering of node i is 
given by 



and would be 0.3 for our example. The clustering nature of a network can 
also be expressed as the average over all nodes of degree k giving a clustering 
distribution (or spectrum) , C{k). 

As noted by Klemm and Egufluz, in a BA network, the average clustering 
of a given node is independent of its degree [28], in contrast to the findings 
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Figure 1: Node i has 5 neighbours which could potentially share 10 links 
between them. Only three out of these 10 possible links exist between the 
neighbours of i. Its clustering coefficient is then expressed Cj = 0.3. 



of Pronczak et al. [26], and tends to zero in the large-size (thermodynamic) 
limit. A network model which does exhibit high clustering is that of Watts 
and Strogatz |47j in which a random rewiring process is carried out on an 
initially regular lattice. The networks generated by such a process feature 
short average shortest path lengths between node pairs (the small-world 
effect). However, these networks do not exhibit a power-law degree distri- 
bution. Subsequently, there have been many efforts to build models which 
can encompass both of these features such as the non-equlibrium growing 
network model proposed by Klemm and Egufluz which features deactivation 
of a nodes availibility to be connected to [28]. Other models such as that 
of Holme and Kim [27] and that introduced by Toivonen et al. [H] mod- 
ified the original BA preferential attachment mechanism, allowing further 
links between the new node and neighbours of the preferentially selected 
node. The social network model of Toivonen et al. produced communities 
with dense internal connections. Szabo et al. formulated a scaling assump- 
tion and a mean-field theory of clustering in growing scale-free networks 
and applied it to the Holme and Kim mechanism [33]. As discussed by 
Boguha and Pastor-Satorras, clustering in networks is closely related to de- 
gree correlations [12] . Infact, based on the work of Szabo et al., Barrat and 
Pastor-Satorras introduced a framework for computing the rate equation 
for two vertex correlations in the continuous degree and continuous time 
approximation [B]. We shall now describe these degree correlations. 

Vertex degree correlations are measures of the statistical dependence of 
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the degrees of neighbouring vertices in a network. In general, n- vertex degree 
correlations, or n point correlations, can be fully characterised by the con- 
ditional probability distribution P{ki, k2, . . . , kn\k = n) that a vertex of de- 
gree k = nis connected to a set of n vertices with degrees ki, k2, ■ ■ ■ , kn- Two 
and three point correlations are of particular interest in complex networks 
as they can be related to network assortativity and clustering respectively. 
More specifically, two vertex degree correlations (two point correlations) can 
be expressed as conditional probability P{k'\k) that a vertex of degree k is 
connected to a vertex of degree k' . Similarly, three vertex degree correla- 
tions (three point correlations) can be fully characterised by the conditional 
probability distribution P{k' , k"\k) that a vertex of degree k is connected to 
both a vertex of degree k' and a vertex of degree k" . This implies that the 
degrees of neighbouring nodes are not statistically independent. Reliable 
estimation of P{k'\k) and P{k' ,k"\k) requires a large amount of data and, 
in practice, one often resorts to related measures. Instead of P{k'\k), the 
average degree of nearest neighbours, {knn)k, is often measured. This can be 
formally related to P{k'\k) [12l[38]. If {knn)k increases with k, high degree 
vertices tend to connect to high degree vertices. A network with this prop- 
erty would be described as being assortative or displaying positive degree 
correlations. If {knn)k decreases with A;, high degree vertices tend to connect 
to low degree vertices ( disassortative or negatively correlated) [12^ l3^ . An 
alternative to {knn)k is to use a normalised Pearson's correlation coefficient 
of adjacent vertex degrees providing a single number measure of assorta- 
tivity as suggested by Newman to further classify networks [341 [35] . These 
approaches are discussed in more detail in Section [6l To characterise three 
point correlations, instead of using P{k' , k"\k)^ one can employ the clustering 
spectrum C{k), which can be related to P{k' , k"\k) [12]. In many real world 
networks such as the Internet [38], the clustering spectrum is a decreasing 
function of degree and while this is sometimes interpreted as a signature of 
hierarchical structure in a network, Soffer and Vazquez suggested that this 
is a consequence of degree-degree (two point) correlations that enter the 
definition of the standard clustering coefficient [32] . The authors introduced 
a different definition for the clustering coefficient that does not have the 
degree-correlation 'bias', i.e. a three-point correlation measure that filters 
out two-point correlations. Following the suggestion of Maslov et al. [33] 
that these phenomenon might arise from topological constraints rather than 
evolutionary mechanisms. Park and Newman demonstrated that dissasorta- 
tive degree correlations observed in the Internet could be explained via the 
restriction of there being no double edges between nodes [37]. In contrast, 
social networks have been found to be assortative [3^. Similarly, Catazaro 
et al. observed that the network of scientific collaborations was assortative 
and presented a model to reproduce this feature [16] . 

Functional processes occurring on networks are influenced by degree cor- 
relations, highlighting the importance of their role in complex networks. 
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Eguiluz and Klemm considered highly clustered scale-free networks and 
showed that correlations play an important role in epidemic spreading [23j . 
The time average of the fraction of infected individuals in the steady-state 
undergoes a phase transition at a finite critical infection probability. They 
related this critical threshold to the transmission probability and the mean 
degree of nearest neighbours of all nodes in the system {knn), the conjec- 
tured criterion for epidemic spreading being related to the product of the 
two. The value {knn) scales with system size in their highly clustered scale- 
free network slower than in the random scale-free which is a byproduct of 
the dissasortativity of the system. Consequently, whereas in random scale- 
free networks in which viruses with extremely low spreading probabilities can 
prevail, the absence of connections between highly connected nodes in highly 
clustered scale- free networks protects the system against epidemics |23j . In- 
terestingly, Bogiina et al. assert that any scale-free network of appropriate 
exponent will have diverging {knn) in the thermodynamic (infinite size) limit, 
resulting in no threshold properties for epidemic spreading regardless of the 
correlations within the network [13] in contrast to the earlier suggestion of 
Bogiina and Pastor- Satorr as [TT]. Brede and Sinha induced correlations to 
Erdos-Renyi and scale-free networks by rewiring them appropriately to ex- 
amine their dynamic stability [TS]. Each node had an associated variable 
whos evolution was governed by some non-linear function of the variables 
associated with neighbouring nodes. They mapped the adjacency matrices 
into Jacobian matrices and examined the largest eigenvectors, reflecting the 
decay rates of perturbations of the nodes' variables about their equilibrium 
states. They found that positive correlations within the network reduced the 
dynamical stability [15j. Similarly, Bernardo et al. induced negative degree 
correlations in scale- free networks whose links couple non-linear oscillators. 
Through analysis of the eigenratio (ratio of the eigenvalues) of the Lapla- 
cian of such networks, they found that network synchronisability improved 
as the network was made more dissassortative [9]. Based on this result, they 
conjectured that negative degree correlations may emerge spontaneously as 
the networked system attempts to become more stable [8j. The same au- 
thors found similar results to hold also for weighted networks f9]. Maslov 
and Sneppen found that in regulatory networks, links between highly con- 
nected proteins were relatively rare increasing the robustness of the network 
to perturbations through "localising deletous perturbations" [32j. This is 
consistent with the findings of Berg et al. whose model of the evolution of 
protein interaction networks exhibited disassortativity consistent with their 
empirical findings [7]. 

Krapivsky et al. used a master equation method, in which rate equations 
for the densities of nodes of a given degree are employed, to investigate the 
steady state of the BA preferential attachment mechanism [29J. This is a 
general method which can be applied to various growing network models. 
To extend the method to encompass two-point correlations, Krapivsky and 
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Redner applied the approach to the number A^^^^^ of nodes with total degree 
k connected to ancestor nodes of total degree / in a directed network |30j . 
They solved analytically the master equations for the steady-state of a spe- 
cific directed network growth model, namely the Growing with Redirection 
algorithm [30] which is further discussed in Section [8] and is similar to the 
mixture model introduced in Section [9l Boguha and Pastor- Satorr as consid- 
ered a link orientated description of two point correlations. For undirected 
graphs, they introduced a symmetric matrix whose elements E^^i represent 
the number of links connecting nodes of degree k to nodes of degree I jl2| . 
Boguha and Pastor-Satorras related this matrix to the joint degree distri- 
bution P{k' I k) but noted that finite size effects make empirical evaluation 
of the matrix difficult, suggesting the use of the spectrum {knn)k as a more 
suitable observable. Here, we introduce a similar matrix construction which 
can be applied to both the undirected and directed scenarios. We call this 
the link-space matrix. We show that it is possible to construct master equa- 
tions to model the evolution of this matrix which can be applied to a wide 
variety of network evolution algorithms retaining important degree correla- 
tions which can be critical to the network's development. This framework 
is termed the link-space formalism and is introduced in Section [5l In cer- 
tain cases, these master equations can be solved analytically providing a full 
time-dependent solution or a steady-state solution of the link-space matrix 
of the network. From these analytically derived link-space matrices both the 
degree and joint degree distributions can be obtained allowing accurate anal- 
ysis of degree correlations. The formalism also allows the derivation of the 
form of networks of predetermined correlation properties such as a perfectly 
non-assortative network (Section [6]) and the derivation of the steady-state 
solutions to network decay algorithms (Section [7]). The formalism can also 
be employed in its iterative guise to provide approximations to the steady- 
state when an analytic solution is not possible such as for the mixture model 
of Section [H 

One and two-point correlations have natural physical counterparts within 
the network, specifically the nodes and links. The one-point correlations 
P{k) are simply related to the fraction of nodes in the network with degree 
k. As described in detail within this paper, two-point correlations P{k' \ k) 
are related to the fraction of links within a network connecting nodes of 
degree k with nodes of degree k' , hence the term link-space. One can extend 
the analysis to three-point correlations, P{k' , k" \ k). This would be related 
to the number (fraction) of pairs of links within a network sharing a com- 
mon node of degree k, the remaining link ends being connected to nodes of 
degrees k' and k" and all possible 'open triangles' within the network would 
have to be considered. Clearly, the process can be extended to arbitrary 
n-point correlations although the physical interpretation of the appropriate 
measurable quantities will become increasingly obscure. 
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3 Notation 

Throughout the subsequent discussions, we shall use the following notation: 

A'^ = the number of nodes (vertices) within a network, 

M = the number of links (edges) within a network, 

Xi = the number of nodes of degree i within a network, 

Ci = the fraction of nodes of degree i within a network 

AT' 

Pi = the probability of selecting an individual node in a network which has degree i, 

Pi = the probability of selecting any node in a network with degree i, 

= XiPi if selecting at random, 

Lij = the number of links from nodes of degree i to nodes of degree j for i ^ j, 

twice the number of links between nodes of degree i in the undirected graph, 
the number of links between nodes of degree i in the directed graph. 



Lii 



M ' 

{knn)i = average degree of the neighbours of nodes of degree i, 

{- — )i = average of 1/degree of the neighbours of nodes of degree i, 

Z Ci rZyin 

Qi = the probability of attachment to a node of degree i in the existing network, 
m = the number of links added per new node in a growing network. 

(2) 



4 Random walkers to generate networks 

The use of random walkers to generate networks was introduced by Saramaki 
and Kaski [40J. In their model, as in many non-equilibrium evolving net- 
works, one new node is attached to the existing network at each timestep 
with ra undirected links. The simplest implementation of their growth algo- 
rithm attaches this new node to the existing network component with m = 2 
links as follows: 



1. Select a node at random within the existing network. 

2. Perform a two-step random walk. 

3. Establish (undirected) links between the nodes arrived at within the 
existing network and the new node. 
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Figure 2: The degree distribution of an example network using the 
Saramaki/Kaski algorithm grown to 10^ nodes. 



Note that a link is not established between the new node and the initial node 
of the random walk. The generalisation is clear for m > 2 links per new 
node attached. To summarize, a random walk of length m steps is performed 
within the existing network from a randomly chosen start point, and a link 
between the new node and each of the nodes reached in the random walk is 
formed with the exception of the start point. 

Certainly this yields results comparable to the scale-free preferential at- 
tachment of the BA model as shown in Figure [2j However, the analytic 
approximations derived by Saramaki and Kaski in reference [30] do not hold 
in all cases. Their analysis is as follows. Consider choosing some vertex 
labeled A initially (at random) within the existing network comprising N 
nodes. This occurs with probability: 

PiA) = ^. (3) 

We now wish to consider the probability of moving to a neighbour vertex 
labeled B after one step of the random walk. Using Bayes theorem, 

,,,, . ^-1^, 

where p'{B) denotes arriving at node B after one step, p{B \ A) denotes 
the probability of arriving at at B if node A is the initial vertex chosen and 
p{A I B) is the conditional probability that, given we have arrived at B, we 
originated at A. The value p{B \ A) can be trivially written 

p{B\A) = ^, (5) 
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Figure 3: A trivial example network. 



where kA is the degree of vertex A. So far so good. However, the next 
justification is somewhat misunderstood. It is often thought [30] that the 
conditional probability p{A \ B) can be written 

P(A\B) = i. (6, 
which combined with the above yields: 

'''(^' = mi- « 

This compares to the the BA preferential attachment probability for attach- 
ing to a particular node of degree i: 

such that the probability of attaching to a node is proportional to its degree. 

To understand the problems with this analysis, consider a real system 
of nodes, as described in Figured We can write the probability associated 
with arriving at node B after a random walk of one step in terms of the 
initial node selected being one of the neighbours of B and the subsequent 
probability of moving to node B: 

p{B) = p{A)p{A ^ B) + p{C)p{C ^ B) + p{D)p{D ^ B) + p{E)p{E ^ B) 

N^kA ^ kc ^ ko ^ kE'' 



25 
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(9) 
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Clearly this is not the same as the earlier expression ([7]). Nor should it be. 
The earlier expression would yield different results depending on which node 
was labeled A which is clearly a nonsense. 

Another interesting assumption often made is that because the initial 
node is chosen at random, its vertex degree will be distributed similarly to 
that of the entire network [lO]. If this is so, ^ can be written in terms 
of arriving at a particular of node of degree i after randomly choosing an 
initial vertex and moving at random to one of its neighbours: 

where {k) is the mean degree of the existing network component. Even aver- 
aging over all nodes of degree i, and (incorrectly) assuming that on average 
the neighbours of these nodes are distributed identically to the network as 
a whole, this would have implied the entirely different result 



Xi 



We will discuss a similar expression in Section [9] and the mean degree of 
nearest neighbours in Section [6l 

In fact, the inherent degree-degree correlations are such that they can- 
not be ignored. We can see this by looking at an example network grown to 
A'^ = 10^ nodes using the simplest form (m = 2) of the Saramaki algorithm 
outlined earlier. Having generated such a network, we shall investigate the 
probability of arriving at a node of degree i after a random walk of one 
and two steps and compare these to the preferential attachment probability. 
Transforming the adjacency matrix of the network generated by the algo- 
rithm into a Markov-Chain transition matrix [2] (dividing the columns by 
their sums) we can look at all possible random walks from all possible start- 
ing points for a number of steps. This is simply acheived by premultiplying 
a vector with all elements by the transition matrix. The result is a 
vector containing the probabilities of arriving at each node after a random 
walk of one step from random initial node. Multiplying this vector by the 
transition matrix again, provides the probabilities of arriving at each node 
in the network after a two-step random walk. Knowing the degrees of all the 
nodes in the network, it is then straight-forward to look at the probability 
of arriving at a degree i node after a one or two step random walk. Whilst 
the degree distribution is described by q which is the same as Pj = ^ 
for selecting nodes at random, the BA preferential attachment probability 
is described in terms of individual node probabilities pi. We use the first 
two parts of (llip to infer the probability of arriving at a specific node of 
degree i after one or two steps. The preferential attachment probability 
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Figure 4: The probability of arriving at a specific node of degree i after a 
random walk of one and two steps compared to the preferential attachment 
probability. This analysis has been carried out on a network comprising 10^ 
nodes grown using the Saramaki and Kaski algorithm. 



would correspond to an infinite random walk. The comparison is illustrated 
in Figure HI Clearly, a one-step random walk is biased towards high degree 
nodes in comparison to preferential attachment whilst a two step random 
walk is biased towards low degree nodes. 

To understand why this model yields the scale-free degree distributions 
equivalent to preferential attachment, we must look at the expected degree 
of the nodes in the existing network to which the new node is to be attached. 
Using the same transition matrix process, the mean degree of nodes reached 
after a number of steps of random walk can be calculated for a given net- 
work (here 10^ nodes) for all possible random walks from random starting 
positions. The expected degree of the first node in the walk is much higher 
than the convergent (preferential attachment) value, and the second lower as 
shown in Figure [5l By interpolating between these two, an approach to pref- 
erential attachment has been achieved for this network growth algorithm, 
even for m = 2 links added (2 step random walk) per new node. 

We will make use of these correlations in our proposed model (see Sec- 
tion [9]). 

5 Node-space and Link-space 

We now introduce the link-space formalism which is based upon the master 
equation method. Consider a simple, growing, non-equilibrium network in 
which one node is added to a network at each timestep and this node is 



13 



25 r 



Preferential Attachment 

* Random Walk 



20 



T# ^ ?f 5? * If 9^ * * * * * * 




5 











5 



10 

random walk length 



15 



20 



Figure 5: The expected degree of the node arrived at after a number of 
steps in a random walk from random initial node. This is performed on a 
realisation of the Saramaki-Kaski network, m = 2 for 10^ nodes. 



connected to the existing network with exactly m undirected links. The 
process is governed by an attachment probability kernel Qj, defined as the 
probability that a specific, newly-introduced link attaches to any node of 
degree j within the existing network. At some time t there exist Xi{t) 
nodes of degree i and we wish to compute the expected number of nodes 
with degree i at time t+1. The node-space master equations can be expressed 
in terms of the attachment kernels and are written 



since a new node of degree m is added to the existing network at each 
timestep and there are no nodes with degree less than m. 

So far we have said nothing about the attachment mechanism, and have 
made the easily geneneralisable restriction that only one node is being added 
per timestep with undirected links. We now follow a similar analysis, but 
retain the node-node linkage correlations that are inherent in many real- 
world systems \n\ [30l [35j . Consider any link in a general network - we can 
describe it by the degrees of the two nodes that it connects. Hence we can 
construct a matrix L(i) such that the element Lij{t) is equal to the number 
of links from nodes of degree i to nodes of degree j for i ^ j at some time 
t. To ease the mathematical analysis below, the diagonal element Li^i{t) is 
defined to be twice the number of links between nodes of degree i for the 
undirected graph, a mathematical convienience also observed by Boguha 
and Pastor-Satorras [12]. For undirected networks L(t) is symmetric and 



{Xm{t + 1)) 



Xi{t) + mQi-i{t) - r 
Xm{t) + 1 - mQm{t) 



mQi{t) i > m, 



(12) 
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j Lij{t) = 2M{t), twice the total number of links M{t) in the network 
which is simply mt when introducing m links per timestep. An explicit 
example is given for the simple network discussed in Section H] which is 
shown in Figure [3l This network will also be further analysed in Section [9l 
The link-space matrix for this network is then expressed as in p3p . Note 
the double counting of the 4 <-> 4 link (B ^ C). 



/ 1 2 4 \ 

10 1 

2 1 

\ 4 1 1 2 / 



(13) 



The matrix, L, represents a surface describing degree-degree correlations 
and is called the link-space matrix. 

Consider one of the newly introduced links in an evolving network, one 
end of which is attached to the new node. The probability of selecting any 
node of degree i — 1 within the existing network for the other end to attach 
to is given by the attachment probability Gi_i(t). Suppose an i — 1 node is 
selected. The fraction of nodes of degree i — 1 that are connected to nodes 
of degree j is 

Li-ij{t) 
{i-l)Xi^i{t)- 

The expected increase in links from nodes of degree i to nodes of degree j, 
through the attachment of the new node to a node of degree i — 1, is given 
by 

ei-i{t)Li_ij{t) 

Xi-i{t) 

Since each link has two ends, the value Ljj- can increase by a connection 
to an [i — l)-degree node which is in turn connected to a j-degree node, 
or by connection to an (j — l)-degree node which is in turn connected to 
an i-degree node. We write master equations governing the evolution of 
the link-space matrix as the evolution of the expected number of links from 
i to j degree nodes, i.e. the number of i j links. This is the link-space 
formalism and is written for the (generalisable) case of adding one new node 
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with m undirected links to the existing network as 



+ 



{L^j{t + l)) = L^j{t)+mej-i{t) + 



mQj^i{t)Lij^i{t) _ mQi{t)Lij{t) 
Xj{t) ' ^'-^ 

mQj_i{t)Lm,j~.i{t) 



mQrn{t)Lm,j{t) 
Xm{t) 

mQj{t)Lm,jit) 



x,it) 



j > m. (14) 



There are a variety of ways in which both the node-space and hnk-space 
master equations can be approached. For example, a full, time-dependent 
solution could be investigated as in Section [5T3] or. using appropriate initial 
conditions, the equations can be iterated over the required timescale as 
in Section [9j We can also investigate the possibility of a steady state of 
the algorithm under scrutiny. To do so, we assume that there exists a 
steady state in which the degree distribution remains staticQ and investigate 
a solution. Under this assumption, the fraction of nodes Ci{t) = Xi{t)/N{t) 
which have a given degree remains constant such that {Xi{t + 1)) — Xi[t) ~ 
dXi/dt = Ci when one new node is added per timestep {N{t) = t). It is also 
assumed that in the steady state, the attachment kernels are static too. We 
drop the notation '(t)' to indicate the steady-state and can rewrite p2]) as 

Ci = m0j„i — mQi, i > m, 

Cm = I- m@rn- (15) 

The fraction of links between nodes of degree i and nodes of degree j can be 
expressed as the normalised link-space matrix, lij{t) = Lij{t)/M{t), which 
sums to 2 in the undirected case. In the steady state we can assume that 
these values are static and can rewrite the link-space master equation (I14p 



^The master equation does not represent the evolution of an ensemble average of net- 
works. Each specific realisation will have its own evolution of attachment kernel which 
cannot be described by the ensemble average. The existence of a steady state to the mas- 
ter equation does not necessarily imply that specific realisations converge upon it in the 
large A'^ limit but simply that the solution is static with respect to the growth algorithm. 
In practice, however, a master equation approach often yields good analytic agreement 
with even single realisations. 
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as 

kj = : z, e" ' hJ>rn, 

m Ci Cj 

'tn Cm Cj 

The notation '(f)' has again been dropped to indicate the steady-state and 
here the generahsable situation of adding one new node per timestep is 
considered. 

To apply the hnk-space formahsm one starts by specifying the model 
dependent attachment kernel 0,. To investigate a time-dependent solution, 
the attachment kernel is substituted into (|12p and (jl4p . To investigate a 
steady-state solution, the kernel is substituted into the ([13]) and to 
yield recurrence relations for Cj and lij respectively which can be solved 
analytically in some cases. The number of z-degree nodes is given by Xi(t) = 

X^/fc which allows us to retrieve the degree distribution from the 
normalised link-space matrix: 

^'^ ^ ~ iN{t) ■ ^ '> 

Degree distributions of empirically observed or simulated networks typi- 
cally become dominated by noise at large degrees reflecting the small prob- 
abilities associated with these values occurring. In the link-space, the situa- 
tion is exacerbated and the high i,j limit reflects connections between these 
high degree nodes. This is, of course, rarer than the existence of nodes of ei- 
ther degree. Following the conventional approach which is applied to degree 
distributions [20] , we can use a cumulative representation of the link-space 
to address this issue. The use of a cumulative binning technique averages 
over stochasticity in the system. With degree distributions, regression tech- 
niques (curve fitting) applied to the cumulative distribution is used to obtain 
a more accurate description of the actual degree distribution than would be 
obtained from fitting to the empirical distribution itself [20]. The process 
can be similarly applied in the link-space. Surface fitting could be applied 
to the cumulative link-space obtained from an empirical network. From 
the cumulative fitted surface, a more accurate representation of the actual 
link-space could be obtained for the empirical network, from which a better 
representation of the network's correlations could be obtained. This pro- 
cess also allows for comparison between simulated and analytically-derived 
link-space matrices. We define the cumulative link-space matrix, cumhj to 
be 

oo oo 

cum^ij — ^ ^ ^ ^ ^XtV (^^) 
x=i y=j 
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Note that we have not lost generahty in that given a cumulative linkspace 
matrix, the actual link-space can be derived using variations on the follow- 
ing: 

— cumJ'iyj cum cum ~\~cum ^i+lj+l- (^9) 

The computation involved when evaluating cumh,j can be cut down consid- 
erably by first evaluating cumh,i, which in the undirected graph is equal to 
2. The leftmost column (or top row) for i > 1 can then be evaluated as 



— cumh— 1,1 ^ ^ h—l,X' (^0) 



cumH,l 



Note that the second term on the right hand side is a row sum of the nor- 
malised link-space matrix and, hence, quickly evaluated. Indeed this row 
sum can be related to the degree distribution Cj_i from (fT7|) . Subsequent 
elements can be evaluated using the following simple identity, 

cum^ij — cum^i—l,j ~\~cum h,j — l 

cum^i—lj — l ~l~ ^i—l,j — l- {'^^) 

We will now demonstrate the use of the link-space formalism to study 
a random attachment model and the Barabasi-Albert (BA) model using 
steady-state solutions and the classical Erdos and Renyi random graph using 
a time-dependent solution. 



5.1 Random attachment model (steady-state solution). 

Consider first a random-attachment model in which at each timestep a new 
node is added to the network and connected to an existing node with uniform 
probability without any preference ('Model A' in [HE]) with one undirected 
link such that m = 1 and M(t) ~ N(t) = t. We assume a steady-state 
solution and the attachment kernel is 0j = Xi/t = Ci. Substituting into 
p5|) . we obtain the recurrence relation Cj+i = Ci/2, which yields the familiar 
degree distribution Cj = 2~\ Substituting into (fT6]) yields the recurrence 
relation 

k,j = {k-i,j + h,j-i)/3, i,j > 1, 

= (c,_i + /i,,_i)/3, i>l, (22) 

with li^i = 0. It is easy to populate the link-space matrix numerically, just 
from the recurrence equations (j22p and the degree distribution. At first 
glance, the solution to the master equation (I22p would be of the form 

hi = ., . . (23) 
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Figure 6: The paths of probabihty flux from c^_i influencing element Zjj. 
Each arrow (step) represents a further factor of ^. 



However, the boundary conditions (which could be interpreted as influx of 
probability into the diffusive matrix) are such that this doesn't hold. We 
can actually solve the link-space for this model exactly. Consider the values 
Cj as being influxes of probability into the top and left of the link-space 
matrix. We can compute the effects of such influx on an element in the 
normalised link-space matrix, li j. Each step in the path of probability flux 
reflects an extra factor of ^. First, we will consider the influx effect from the 
top of the matrix as in Figure El The total path length from influx Cx-\ to 
element /jj- is simply i + j — x. The number of possible paths between Cx-\ 
and element /jj can be expressed ("'"Jl'^*"^) using the conventional binomial 
combinatorial or 'choose' coefficient. We can similarly write down the paths 
and lengths for influxes into the left hand side of the matrix. The first row 
(and column) can be described as 



3 



fc=l 



(24) 
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Figure 7: Comparison of the numerically derived normalised link-space ma- 
trix and the analytic solution for random attachment whereby one new node 
is added with one undirected link per timestep. The numerical implemen- 
tation populates the link-space matrix directly from the degree distribution 
and the link-space recurrence relations. The first twenty rows (i : 1 — > 20) 
of the link-space matrices are illustrated. 



Subsequent rows can be similarly described. So, for i,j>l an element can 
be written 

~ 3 

j /i-l+j~x\ 
J,(i+i-x)2{x~l) 

x=2 

i n—l+j—x\ 

E \ i—x I (')K\ 
'^(i+j—x)2{x—l) ' ^ ' 

x=2 

This expression is compared to a numerically populated link-space matrix in 
Figure [71 We shall make use of this solution to investigate the correlations of 
such a network in Section [6l This normalised link-space matrix is illustrated 
in Fig.[8]along with a comparison to simulated networks using the cumulative 
link-space matrix. 
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Figure 8: Left: the analytically-derived, normalised link-space matrix for 
the random attachment growth algorithm. This could be filled to arbitrary 
size but is truncated to maximum degree of 40 here. Right: comparison of 
the cumulative link-space matrices for the analytic solution and a simulation 
of the algorithm. The simulation comprises an ensemble average of 100 
networks grown to 10^ nodes. The maximum degree obtained was 37. The 
first 30 rows {i values) of the cumulative link-space are illustrated and finite- 
size effects are noticable at high j. 



5.2 Barabasi-Albert (BA) model (steady-state solution). 

In the BA model [ll[5], at each timestep a new node is added to the network 
and connected to m existing nodes with probabilities proportional to the 
degrees of those nodes, i.e. @i oc i yielding 

_ iXijt) _ iXijt) ^ iXijt) 

We consider the scenario when the new node is added with one undirected 
link, m = 1, and the attachment kernel is well approximated by @i ~ ici/2. 
Substituting this into (fTS]) yields the recurrence relation Cj = ^2^'~^ — 
j^Ci-i, whose solution is Cj = j^^r^jj^- This corresponds well to an actual 
network grown using this algorithm. Using the same substitution, the link- 
space master equations yield the recurrence relations: 

, _ - + 0' - ^)kj-i ■ • . T 

- 2 + i + j . *'J>^' 

= + ,>i. (27) 

Again, it is easy to populate the matrix numerically just by implementing 
the known degree distribution and the link-space recurrence equations but 
we can solve the link-space analytically. At first glance, the solution to the 
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master equation (f271l would be of the form 



w 

i{i + + 1) 



kj = -7--^^77--r.^ (28) 



where u; is a constant. This would imply for the degree distribution 

w 



i2(i + l)' 



(29) 



Summing over the entire node-space would give w = However, the 

boundary conditions (which could be interpreted as influx of probability 
into the diffusive matrix) are such that this solution doesn't hold. We can 
obtain an exact (although somewhat less pretty) solution by tracing fluxes 
of probability around the matrix and making use of the previously derived 
degree distribution. The steady-state link-space master equation for general 
attachment kernel is written 

1 + 2i 



e,-i 



-h,j~i + 0J-1 



1 +ei ^ o, 

Cl Cj 

Ki = 0. (30) 



We can rewrite pop in terms of vertical and horizontal components: 

h,j = '^i,j{h,j-i + Cj^i), 

h,i = 0, (31) 

where 





Ci- 


-1 
1 




1 + 


©1 

Ci 


+ 






0.7- 


-1 








■ 1 




1 + 


©1 


+ 


©1 



TT.. = . T (32) 
By considering the probability fluxes as shown in Figure [9l we can write 
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Figure 9: The components of flux of probability around the normalised 
link-space matrix. This is general to any attachment kernel. An element 
within the link-space matrix can be built up from contributing elements and 
the appropriate factors. 
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the individual elements in the link-space matrix as 

j / j 



y=2 ^ x=y ^ 

k,j = Z^ ( h-l,y^i,y j 
V=2 ^ x=v+l ^ 



x=y+l 

j 



^1,1 = 0. (33) 



Note that we have yet to introduce the attachment probability kernels and 
the analysis so far is general. Using the preferential attachment probability, 

e. - f , (34) 

we can write our component- wise factors for the master equation 

i + j + 2' 



i-1 

i + j + 2' 



Substituting (j35]) into (133]) and using the previously derived degree distri- 
bution yields for the first row 



4 -i-r X — 1 



.y(y-l)(y + l),J-4x + 3 
4(j-i)! x^, 

2(j + 6)(j-l) 
j(j + l)(i + 2)(j + 3)- 

Subsequent rows can be described: 



(36) 



_ (j-l)!(3 + z)! ^ (j-1)! (l+. + x)! 

" (2 + i+j)! ^2 ^2 + i + j)! (x-1)! 

(j-1)' f.o , -Nw , .^V-, (1 + i + x)! 



(2 + i + i)! 
Rewriting ([37|) gives 



|(3 + z)!/,i + (z-l)^/._i,.^i^±^±|^|. (37) 
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where the meaning of Ki and is transparent. Clearly, we can write 

§B for li^i,x, 

k-i,x = , . ,^\, {ki^i + Ei^ix\- (39) 
[1 + I + xy. [ 'J 

Substituting ([39]) into §7^ yields 

= (2+7+i)! {^- + (i-l)^^(AVi + (40) 

In order to solve this recurrence relation we define an operator for repeated 
summation, SJ^^ such that 



x=2 

j X„ ^n-l X4 X3 X2 



suf{x)) = E E E •■■EEE/(^)- (41) 

The subscripts denote the initial variable to be summed over and the final 
limit. A few examples of this operation clarify its use: 

SUfi^)) = fix). 
5,,,.(1) = 

5|,(1) = 5,, ,.(x) - 5,, ,(1) 

slxW = lif-j), 



6 

etc. 

2 



Sj,x{x) — ^ 1, 



SU^) = ^(/ + 3j2-4j), 

etc. (42) 

We can use this operator in our expression for the element lij in (I40p and 
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expand to the value -E2,x) 

+ (i _ + - l)(i _ 2)Sl^{Ki.2 + Ei_2,x) 





■ 1)1 


{2 + i 




u- 


1)! 


{2 + i 




u- 


1)! 



+ 

+{i - m - m - 3)K,_3S|,(1) + ... + - 2){i - 3) * ... * 2K2S^r^\l) 

+{i-l){i-2)*....*2S%\E2,.)\. (43) 



We can express S2,x in terms of the operator S too, 

E2,x = 45,.,425,,,(1) + . (44) 

The element lij can be expressed as 

(45) 

This form is somewhat obscure as the calculation of the operator values is 
less than obvious. However, we can transform to a more easily interpreted 
operator W{n) analogous to S but with different limits (which enables fur- 
ther analysis) such that 

x=l 

j Xn Xn-l X4 X3 X2 

KMi^)) = E E E •••EEE/(-)- (46) 

Xn = l Xn-l = l Xn-2 = l x-i = l X2 = l X = l 

Whilst, at first glance, it looks like little progress has been made, for this 
analysis we only need evaluate the repeated operation on initial function 
f{x) = 1. This is exactly solvable and as such, we can rewrite in terms of a 
function G{n) dropping the superfluous x subscripts, 

G,{n) = Wf{l), 

G,{n) = niu-iy. iorn>0 ^^^^ 
10 tor n < 0. 

The inductive proof associated with (1T7|) can be understood by path count- 
ing for some repeating binomial process and is an intrinsic property of the 
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Figure 10: A binomial process over seven steps. The number of paths 
between A and B can be expressed as the sum of the paths A — > W, 
A ^ X, A ^ Y and A ^ Z . 



combinatorial choose coefficient. Consider a repeated coin toss over x steps. 
The number of ways of achieving n + 1 successes after these x iterations is 
in+i) ■ l^ow, the occurrence of this last success could have happened on the 
{n + l)th iteration or the following one, or any of the subsequent iterations 
till the xth one. For the last successful outcome to occur on the mth step, n 
successful outcomes must have occurred in the previous steps. The number 
of ways this could have occurred is C""^) . Clearly, summing over all possible 
m values, the total possible paths resulting in n + 1 successes must equate 
to UJ. 

For clarity, this is depicted in Figure [TUl To reach point B from A in 
the binomial process, one of the steps w, x,y or z must be traversed, after 
which there is only one route to B. The number of paths between A and 
B utilising step w is the same as the number of paths between A and W. 
Similarly, the number of paths between A and B utilising step x is the 
same as the number of paths between A and X and so on. The number of 
paths between A and B can be expressed as the sum of the paths A W, 
A ^ X. A ^ Y and A — s- Z. We incorporate this behaviour into our proof 



27 



for the solution of Gj{n) 



G,{n + 1) = ^G,i 



j + n 
n 

3 

1 
j 



E 

1 

j + n 
n + 1 



J +n 
n 



(48) 



As G(l) = (j) = j, this inductive proof holds for all n and the following 
relations hold 

S^'il) = G{n) -G{n-1), 

S''{j) = G{n + 1) -G{n-1). (49) 

We can now write the element in our link-space matrix for preferential 
attachment exactly as our function G{n) is easily evaluated, 

= 2)1^' + 1) + 2G(0 - 3G(^ - 1) 

1 * 

+ - ^{k - l){k + 6)(G(i -k)-Gii-k-l) 



2 

k=l 



(50) 



and the first few rows of our link-space can be written 

_ 2(6 + i)(i-l) 

~ j(j + l)0- + 2)(j + 3)' ^''^ 

, ^ 2i(j-i)(i + io) + 48 

3j(j + l)(i + 2)(i + 3)(j + 4)- ^ ^ 

A comparison of the numerically populated link-space matrix and the an- 
alytic solution for the BA preferential attachment model with m = 1 is 
illustrated in Figure [HI We shall use this solution to investigate the corre- 
lations with this network in Section [6l This normalised link-space matrix is 
illustrated in Fig. [12] along with a comparison to simulated networks. 



5.3 Erdos and Renyi random graph (time-dependent solu- 
tion). 

The Erdos and Renyi (ER) classical random graph is the quintessential 
equilibrium network model |20l I22j . However, to employ the link-space for- 
malism, which tracks the evolution of a network's correlation properties. 
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analytic solution 
numerical implementation 




Figure 11: Comparison of the numerically derived normalised link-space 
matrix and the analytic solution for the steady state for preferential attach- 
ment whereby one new node is added with one new link. The first twenty 
rows (i : 1 — > 20) of the link-space matrices are illustrated. 




iog{j} 




Figure 12: Left: the analytically-derived, normalised link-space matrix for 
the preferential attachment growth algorithm. This could be filled to arbi- 
trary size but is here truncated to maximum degree of approximately 10^. 
Right: comparison of the cumulative link-space matrices for the analytic 
solution and a simulation of the algorithm. The simulation comprises an 
ensemble average of 1000 networks each grown to 10'' nodes. The maximum 
degree obtained was 17609. Various rows (i values) from 1 to 3500 of the 
cumulative link-space are illustrated and the effects of the finite nature of 
the simulation are apparent. 
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we must model it as an evolving, non-equilibrium network. The classical 
random graph can be modelled as a constantly accelerating network with 
random attachment. At each timestep, we add one new node to the existing 
network. All possible links between the new node and all existing nodes are 
considered and each is established with probability a. That is, a biased coin 
toss (Bernoulli trial) is employed for every node within the existing network 
to decide whether a link is formed between it and the new node. Conseqently 
the expected number of new, undirected links with which the new node con- 
nects to the existing network is {m{t)) = aN{t — 1) « at and this would be 
described as an accelerating network |41j . No new links are formed between 
existing nodes. This model subsequently produces a network in which the 
probability of a link existing between any pair of nodes is simply a and is 
thus representative of the ER random graph [22]. A network grown to t 
nodes will have mean degree at which will also be the expected degree of 
all nodes in the network. Whilst the random attachment and preferential 
attachment models both have steady-state assymptotic behaviour, clearly 
this model doesn't and a full, time-dependent solution is required. 

Using the random attachment probability kernel @i{t) = Xi(t)/N{t), 
the node-space master equation for the number of nodes of degree i can be 
written 

{X,[t + 1)) - X,{t} + 

+P{m{t) = i}. (53) 

The time-dependent solution of the degree distribution for this model is 
simply 

Ci{t) = (54) 

which is what we would expect for the random graph with mean degree of 
at in the large size limit [Hj. We can similarly write the link-space master 
equation for this model: 

(L,,(t + 1)) = L,,(t)+ ^"^^^)^Q-^(^)^-^-^- 



+ 



{m{t))ej-i{t)Lij^i _ {m{t))Qi{t)Li^j 
Xj^i Xi 



{m{t))ejit)Lij 



+ iej-iP{m{t) = i} 



+jGi_iP{m(t) = j}. (55) 

The last two terms reflect the expected number of new i ^ j links be- 
ing formed between the new node and existing nodes. Recalling that the 
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Figure 13: Left: the analytically-derived, normalised link-space matrix for 
the Erdos and Renyi (ER) random graph with mean degree equal to 5. 
This could be filled to arbitrary size but is here truncated to maximum 
degree of 30. Right: comparison of the cumulative link-space matrices for 
the analytic solution and a simulation of the algorithm. The simulation 
comprises an ensemble average of 500 networks each consisting of 10^ nodes. 
The probability that a link exists between any pair of nodes is a = 5 x 10"'^ 
and the maximum degree obtained was 25. The first 20 rows {i values) of 
the cumulative link-space are illustrated and, again, the finite nature of the 
simulation is apparent for high j. 



normalised link-space matrix is given by lij{t) = Lij{t)/M(t) where the 
number of links, M{t), at time t can be approximated as M{t) ~ at^/2 and 
making use of the previously derived, time-dependent degree distribution 
such that P{m{t) = j} = Cj{t), ((55]) can be rewritten as 



d ( at^lij{t)\ _ {atf 



dt \ 2 2 



I 2ijciit)cj{t) ^ ^^g^ 

The solution to ()56p is found by means of ansatz (the choice of which is 
explained in Section [6]) and is 

This is the normalised link-space matrix for a classical random graph with 
a mean degree of at in the large size limit and this is illustrated in Fig [13] 
where a comparison to a simulation is also made. 
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Figure 14: The mean degree of the nearest neighbours {k 

nn/i 3j function 

of i for both the preferential and random attachment algorithms and the 
calssical random graph described in Section [6.11 



6 Degree correlations and assortativity 

As discussed in Section [21 calculation of the mean degree of nearest neigh- 
bours of a node as a function of the degree of that node, {knn)i presents an 
indication as to the degree correlations of the network. This calculation is 
straightforward if one has the link-space matrix for a particular network, 



(58) 



Ej kj 

This is illustrated in Figure [TH If the average nearest neighbour degree 
{knn)i is constant, non-assortativity is implied. Certainly, the preferential 
attachment curve in Figured!] appears to asymptote to a constant value (as 
was noted in [23j). whilst that of random attachment continues to increase, 
implying positively assortative mixing. 

In search of a single number representation, Newman [3lj further stream- 
lined the measure by simply considering the correlation coefficient of the 
degrees of the nodes at each end of the links in a given network, although 
he subtracted one from each first to represent the remaining degree. Whilst 
this might be of interest in some situations, the correlation of the actual 
degrees is arguably a more general measure although the two do not differ 
much in practice. Using a normalised Pearson's [M] correlation coefficient, 
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the measure of assortativity is expressed in terms of the degrees {i and j) 
of the nodes at each end of a Hnk, and the averaging is performed over all 
links in the network. 



{ij) assort i"^) assort (j) assort 



The normalisation factor represents the correlation of a perfectly assortative 
network, i.e. one in which nodes of degree i are only connected to nodes of 
degree i. This would be of the same degree distribution as the network under 
consideration and has the same number of nodes and links. The physical 
interpretation is less than obvious especially as we are considering trees. 
However, for this assortative network, lij = ici5{i — j), i.e. all off-diagonal 
elements are zero. Whilst this assortativity measure can be expressed as 
summations over all links if calculated for a specific realisation of a network, 
we can express it in terms of the previously derived link-space correlation 
matrix, namely 



Eoo v^oo ijh,j _ ( v^oo j Cj 
i=l Z^7=l 2 V 2^7=1 2 



j=l 2 \^j=l 2 



2 



(60) 



It is interesting that the normalisation factor is degree-distribution specific. 
Evaluating this measure numerically, we obtain for the preferential attach- 
ment algorithm the value rpa ~ and for the random attachment algorithm, 
rra ~ 0.6. So according to this measure of degree mixing, the randomly 
grown graph is highly assortative and the preferential attachment graph is 
not. 

The link-space formalism allows us to address two-vertex correlations 
in a more powerful way in that we can calculate the conditional, two-point 
vertex degree distribution, P{j\i) which has been traditionally difficult to 
measure [H]. We can write this joint probability, i.e. the probability that a 
randomly chosen edge is connected to a node of degree j given that the other 
end is connected to a node of degree i, in terms of the link-space matrix (nor- 
malised or not) as P{j\i) = Lij/{iXi) = Uj/^JL^Lij = k,j / Y!fLik,j- 
For total number of edges aproximately equal to the total number of nodes, 
this can be approximated as lij/{ici). This is illustrated in Fig. [15] for the 
scenarios of random and preferential attachment. 

Consider selecting an edge at random in the network. If we select one 
end of the edge at random, the probability that this node has degree j will 
be proportional to j since higher degree nodes have, by definition, more 
links connected to them than low degree nodes. Consider now only a subset 
of all edges with one end attached to a node of degree i. If there were no 
correlations present, the probability that the other end is attached a node of 
degree j is again proportional to j. This is the criterion of non-assortativity 



33 



- non-assortative 




degree / 



degree j 



Figure 15: The conditional probability P{j \ i) that one end of a randomly 
selected link has degree j given that the other end is of degree i over the 
range i = {1, 6, 11, 16, 51}. The left plot is for the random attachment 
algorithm and the right is preferential attachment. The criterion of non- 
assortativity of (I6ip is denoted with red circles. 



and for total number of edges ~ total number of nodes can be expressed 

P{j\i) = P{j) 

= f • (61) 

This is also the criterion that a one step random walk replicates preferential 
attachment. 

6.1 Perfect non-assortativity 

It is interesting to ask whether a perfectly non-assortative network can be 
generated. Assuming that such a network will not have equal numbers of 
nodes and links, we must rewrite the conditions of non-assortativity accord- 
ingly for total number of edges M: 

P(;N)=PO-) = i| = ^. (82) 

Recalling our definition of the normalised link-space matrix that /j j = 
Lij/M, we can express this conditional probability P{j\i) in terms of this 
matrix as 

pm = ^ = ^- (63) 

As such, we can now write for /jj- 
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That is, for any degree distribution, a normalised link-space matrix can 
be found which is representative of a perfectly non-assortative network H. 
This might provide an alternative to the network randomisation technique of 
Maslov and Sneppen to provide a "null model network" ^32j. The expression 
of (I64p provides the ansatz solution for the normalised link-space matrix to 
the master equations for the ER random graph in Section [5.31 

7 Decaying networks 

Although somewhat counter-intuitive, it is possible to find steady states of 
networks whereby nodes and/or links are removed from the system. Aside 
from the obvious situation of having no nodes or edges left, we would like to 
investigate the possible existence of a network configuration whose link-space 
matrix and, subsequently, degree distribution are static with respect to the 
decay process. The concept that decay processes are highly influential on a 
network's structure has been considered before although this has typically 
only been investigated in conjunction with simultaneous growth [2511311139] . 
Here we shall employ the link-space formalism to examine the effect of some 
simple, decay-only scenarios, specifically the two simplest cases - random 
link removal and random node removal. 

7.1 Random Link Removal (RLR) 

Consider an arbitrary network. We select w links at random and remove 
them. We shall implement the link-space to analyse the evolution of the 
network. Consider the link-space element Lij{t) denoting the number of 
links from nodes of degree i to nodes of degree j. Clearly this can be 
decreased if an i j link is removed. Also, if a A; <-> i link is removed 
and that i node has further links to j degree nodes, then those that were 
i ^ j links will now be i — 1 <-> j, similarly for k j links being removed. 
However, if the link removed is a A; ^ j -|- 1 link and that j + 1 degree 
node is connected to a i degree node, then when the j + 1 node becomes 
an j node, that link will become an i ^ j link increasing Lij as shown 
in Figure [THl Let us assume that we are removing links at random from 
the network comprising N(t) vertices and M{t) links. A non-random link 
selection process could be incorporated into the master equations using a 
probability kernel. The master equation for this process can be written in 

network can be constructed for an arbitrary, normalised link-space matrix. A diago- 
nal element can be simply obtained through the addition of fully connected components 
of i -I- 1 nodes. Similarly an off-diagonal element kj can be generated through suitable 
rewiring of i-degree and j-degree cliques. 
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terms of the expected increasing and decreasing contributions: 




^ M{t)ij)X,it) 



^ Lk,i{t){i - l)Lij{t) 
^ M{t){i)X,{t) 



wLij{t) 
M{t) 



(65) 



This simpUfies to 




wiLi+ij{t) w{i - l)Lij{t) 



M{t) M{t) 
w{j - l)Lij{t) _ wLij{t) 
M{t) M{t) 



(66) 



This expression can be simply understood by considering Figure [TCI For 
each i <-> J + 1 hnk (labelled A in the figure), there are j links {B,C,D) 
whose removal would make link A an i j link. This would be the same for 
all of the z ^ j + 1 links, of which there are Ljj_|_i(t) in the network. The 
expected increase in Lij through this process corresponds to the second 
term on the right hand side of ()66p . The other terms are equally simply 
obtained, with the last referring to the physical removal of an i j link. 

To check that all is well, we can explicitly look at the evolution of the 
link-space matrix for the simple example network discussed in Section S] and 
shown again in Figure [TTl Prior to removing any links, the link-space can 
be simply written as in (i67|) . 



The expected link-space matrix after removing a link at random can be 
interpreted as the ensemble average of all possibilities. By removing a single 
link from the original network, the resulting link-space matrix for the new 
network can be easily generated. Replacing that link and performing this 



/ 1 2 4 \ 
10 1 
2 1 



(67) 



\ 4 1 1 2 / 



36 



Figure 16: Removal of any of the links B,C,D will result in the link A 
becoming an i <-> j link. 




Figure 17: A simple network which we shall examine to check the validity 
of the master equation (|66|) for the evolution of the link-space matrix for 
the random link removal process. 
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process for all links and simply averaging all the resulting matrices, we 
can generate (albeit somewhat laboriously) the desired expected link-space 
matrix for the network at the next timestep. This yields the result of (I68p 
which represents the average over all possible resulting networks after one 
link has been removed at random. This concurs with the master equation 
for this process given in (I66p . 



( 2 12 26 25 \ 

12 3 7 

26 3 6 10 

V 25 7 10 6 / 



(68) 



To investigate the possibility of a steady state solution, we make a similar 
argument as before for growing networks but this time for the process of 
removing w links per timestep. 

Lij{t) = lijM{t), 

= kj(Mo-wt), 



dt 



-wlij, 



« Lij{t + 1)- Lij{t). (69) 
The expression of ([66]) can be reduced to 

i + j-2 ^ ' 

The number of 1 1 links does not reach a steady state. This might be 
expected, as the removal process for such links requires them to be physically 
removed as opposed to the process by which the degree of the node at one 
end of the link is reduced. Subsequently, the value li^i would increase in 
time. However, we can investigate the properties of the rest of the links 
in the network which do reach a steady state by neglecting these two node 
components. In a similar manner to [3^ we can make use of a substitution 
to find a solution to this recurrence equation, namely for i -\- j ^ 2. 



(i-l)!(j-l)!' 



7. . - a- ■ ^ ■' f71) 



and we obtain 

This has a trivial solution 



+ Qiyj+l- (72) 



= (73) 
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log; 



Figure 18: The shape of the steady state link-space for random hnk removal 
from an infinite network numerically populated using (j74|) for i + j 7^ 2. 



The link-space for this system can then be written for z -|- j 7^ 2 

2^+i(i-l)!(i-l)!- ^ ' 

The summation over this link-space matrix does not converge, i.e. lij 
00. However, we can still derive the degree distribution and normalise this 
such that Yli Cj = 1 • We can use the form of (I74p to infer the shape of the 
degree distribution. Recalling that 

- ^ . , (75) 

we can write for the degree distribution, neglecting the two node components 
in the network, 

MA^ {k- 2)! 
~ "^^(fc-l)!2i+'=' ^ ' 



Ni {i - iy.{k - 1)12^+'= ^ ' 

The equation (I76p can be easily solved by considering the Maclaurin Series 
(Taylor Series about zero) of the function log(l-l-x) (as studied by Mercator 



Ci 
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as early as 1668) and evaluating for x = — ^ 



oo (_l)k' 
log(l + x) = ^^-^X 
k'=l 



2' 

k' 



^ ^ k'=i 

= -log(2). (78) 



Rearranging ([76]) and letting k = k' + 1, 

^1 = 



N ^ [k- l)2^+k 



MA 
MA 



k=2 

oo 



k'=l 



4Ar l°g(2). (79) 

For nodes with degree greater than one, the solution of (j77p requires a simple 
proof by induction. Consider the function Q{i') defined for positive integer 



i' as 



oo n'+k'\ 
fc'=0 

where (* ) denotes the conventional binomial coefficients. We can write 
Q{i' + 1) with similar ease: 

fc'=0 

As such, 

i' + k' + l\ fi' + /c' 



2Qi^' + l)-Qi^') = Y.{(^1^') 



fc'=0 ^ ^ ^ ^ ^ ^ 

2^ Ifc'-ijF^- (^^) 



k'=l 

A quick substitution of k" = k' — 1 leads to 

'i' + k" + 1\ 1 



A:"=0 ^ ^ 



2A:"+j'+l 



= Q(z' + 1) 

Q{i' + 1) = Qii'), 

Q(l) = 2 

= Q(«') Vi'. (83) 
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Rearranging ([771) and using some simple substitutions, i' = i — 2 and k' = 
k — 1 we can derive the following 



MA^^ {i + k -3)1 



(i-l)!(A:-l)!2^+'^ 



Ni 

MA ^ {i + k -3)1 



Ni{i -1)^ {i- 2)\{k - l)!2^+'= 

k=l 

MA - c'tn 

Ni{i - 1) ^ 2*'+^'+3 
^ ^ k'=0 

MA 1 



AN i{i - 1) 



(84) 



We can normalise this degree distribution for the network without the two 
node components (the 1 <-> 1 links): 



A 



Cl 



AN 



M(l + log(2)) ' 

log(2) 
1 + log(2) ' 
1 

(l + log(2))i(i-l) 



i > 1. (85) 



Although difficult to compare to simulations, we can populate a link-space 
matrix numerically using (I74p as illustrated in Figure [TSl With this, we can 
compare the degree distributions using ([75|) and those of as shown in 
Figure UHl 



7.2 Random Node Removal (RNR) 

In a similar manner to Section 17.11 we will now discuss the possibility of 
creating such a steady state via a process of removing nodes (and all of 
their links) from an existing network at the rate of w nodes per timestep. 
Clearly, removing some node which has a link to an {i + l)-degree node 
which in turn has a link to a j-degree node can increase the number of i <^ j 
links in the system. The other processes which can increase or decrease the 
number of links from i degree nodes to j degree nodes can be similarly easily 
explained. We consider that we select a node of some degree k for removal 
with some probability kernel Qk (as in the growing algorithms of Section 
[S]). The master equation for such a process can thus be written for general 
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Figure 19: Comparison of the degree distributions from the numerically 
populated link-space matrix and the analytic results of ()85p for the Random 
Link Removal model. The two node components have been neglected in 
both the numerical matrix population and analytically derived expression. 



node selection kernel: 

@k{t) f iLi+i,j{t) jLi,j+i{t) ] 

^Xkit)\ '^•^^^ iX.it) ^ ^^'^^ jX,{t) / ^ ^ 

We now select a kernel for selecting the node to be removed, namely 
the random kernel although the approach is general to any node selection 
procedure. For this process Qk = as a node is selected purely at random. 
The steady state assumptions must be clarified slightly. As before, 

Lij(t) = lijM{t). (87) 

However, because we can remove more than one link (through removing a 
high degree node for example) we must approximate for M{t). Using the 
random removal kernel, we can assume that on average, the selected node 
will have degree {k){t) = and we can use this to calculate the number 
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Figure 20: The mean degree of the neighbours of nodes of degree i for the 
random hnk and random node removal algorithms. 



of remaining links in the network. 



Lij{t) = lij{Mo-w{k){t)) 
, 2wM(t) , 

dLij _ 2wM{t) 

« Lij{t + l)-Lij{t). 



Making use of the link-space identities and (jSSp . the master equation 
()86p can be written in simple form: 

i + j -2 ^ ' 

Clearly, this is identical to (j70p for the random link removal model of Sec- 
tion 17.11 and consequently, the analysis of the degree distribution will be the 
same too. By considering the average degree of the neighbours of nodes 
of degree i, denoted {knn)i and making use of (f58]l in Section [6] we can see 
that both the random link removal and the random node removal algorithms 
generate highly assortative networks as evidenced in Figure [501 
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Figure 21: The Growing Network with Redirection. A random initial node 
is selected. The new node hnks to (and points at) this node with probability 
a or else is redirected to and attaches to the ancestor of the node. 

8 The Growing Network with Redirection model 
(GNR) 

In [30] , Krapivsky and Redner suggested a model of network growth without 
prior global knowledge using directed links. Within the same paper, they 
also provide analysis for the evolution of the joint degree distribution of 
simpler, directed growing networks (GN), which is similar to the link-space 
analysis outlined above. The GNR algorithm is implemented as follows: 

1. Pick a node n within the existing network at random. 

2. With probability a make a directed link pointing to that node or 

3. Else, pick the ancestor node of k (the node which its out-link is directed 
to) and make a directed link pointing to that node. 

We note that because each new node is attached to the existing network 
with only one link, each node has only one out link. When redirection occurs, 
the destination is predetermined. The degree distribution for this network 
is easily obtained. Consider some node of total (in and out) degree i labeled 
A. Its out-degree is 1 so its in-degree is simply i — 1. Thus, it must have 
i — 1 'daughters'. If any of these i — 1 daughter nodes is selected initially 
in the GNR algorithm and redirection occurs, then the new node will link 
to and point at A. It is straightforward to construct the evolution of the 
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node-space for the total degree of nodes within the system. The attachment 
probabihty kernel for the new node linking to any node of degree i within 
the existing network can be written as 



The first term on the right hand side corresponds to initially selecting a 
node of degree i — 1 and linking to it. The second term reflects redirection 
from any of the daughters of any of the degree i nodes. The static equation 
for the steady state degree distribution becomes 



For a = 0.5, the familiar recurrence relation of preferential attachment is 
retrieved and the distribution of total degree of nodes will subsequently be 
the same too. 



9 The External Nearest Neighbour Model (XNN) 



In this section we introduce a model that makes use of only local information 
about node degrees as microscopic mechanisms requiring global information 
are often unrealistic for many real- world networks ^45j . It therefore provides 
insight into possible alternative microscopic mechanisms for a range of bio- 
logical and social networks. The link-space formalism allows us to identify 
the transition point at lower power-law exponent degree distributions switch 
to higher exponent power-law distributions with respect to the BA preferen- 
tial attachment model. While similar local algorithms have been proposed 
in the literature \24:\ HQ] , the strength of the approach followed here is the 
ability to describe the inherent degree-degree correlations [181 ES EH EH] . 

It is well known that a mixture of random and preferential attachment 
in a growth algorithm can produce power-law degree distributions with ex- 
ponents greater than three, 7 € [3, 00). It has often been assumed that 
a one step random walk replicates linear preferential attachment [401 145) . 
This is not true. A one step random walk is in fact more biased towards 
high degree nodes than preferential attachment (as discussed in Section H]) 
as can be easily seen by performing the procedure on a simple hub and spoke 
network. In this case, the probability of arriving at the hub tends to one 
for increasingly large networks rather than a half as would be appropriate 
for preferential attachment. We can use this bias to generate networks with 
degree distributions that are overskewed (lower power-law exponent) than 
the preferential attachment model. A mixture of this approach with random 
attachment results in a simple model that can span a wide range of degree 
distributions. This one-parameter network growth model, in which we sim- 
ply attach a single node at each timestep, does not require prior knowledge 
of the existing network structure. 




(90) 




(91) 
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Figure 22: External Nearest Neighbour: some examples of the graphs gen- 
erated for 300 nodes. Left is a = 1 , center is a = 0.2 and right is a = 0. 
There is no spatial relevance in these plots which are drawn with pajek [H]. 




degree / degree / degree / 



Figure 23: The (cumulative) degree distributions for the evolving networks 
grown to 10^ nodes, ensemble averaged over 100 networks per parameter 
value. These have been initiated from a 2 node, 1 link component. 



Explicitly, the External Nearest Neighbour algorithm proceeds as fol- 
lows: 

1. Pick a node k within the existing network at random. 

2. With probability a make a link to that node or 

3. Else, pick any of the neighbours of k at random and link to that node. 

Hence this algorithm resembles an object or 'agent' making a short random- 
walk. This is very similar to the Growing Network with Redirection model 
discussed in Section [8] except that the model here employs undirected links 
which necessitates a random (as opposed to deterministic) walk. Clearly, 
this algorithm generates trees, some examples of which are illustrated in 
Figure [221 

The resulting distributions of vertex degree from implementing this growth 
algorithm are shown in Figure [23l These are ensemble averages over 100 
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networks per parameter value. Interestingly, a = yields a network that 
is dominated by hubs while a = 1 yields the random attachment network. 
Intermediate values of a yield networks which are neither too ordered nor 
too disordered. For o « 0.2, the algorithm generates networks whose degree- 
distribution closely resembles the BA preferential-attachment network. 

Our analysis of the algorithm starts by establishing the attachment prob- 
ability Oj, which in turn requires properly resolving the one-step random 
walk. The link-space formalism provides us an expression for the probability 
P/ associated with performing a random walk of length one and arriving at 
any node of degree i. 

Pi = PlPl~.^ + P2P2~.i + P2P2-.^ + ^3^3-.^ + 

Xi{t)L^,i{t) X2{t)L2,i{t) X^{t)L3,i{t) 
N{t)lXi{t) ^ N{t)2X2{t) N{t)3X3{t) 

N{t) ^ ~' 
We can reconcile this with the alternative expression 



(92) 



„, iXjit) , 1 > , , 



where the average is performed over the neighbours of nodes of degree i. 
With proper normalisation, 



_ 1 \ - Lj^kjt) 



iXi{t) ^ k 



(94) 



which clearly yields the same results. Note that this quantity does not 
replicate preferential attachment, in contrast to what is commonly thought 
|401 145j. We shall perform this process explicitly for the example (A^ = 11, 
M = 10) network of Figure [TT] with the link-space matrix 

/ 1 2 4 \ 

10 1 



2 1 
\ 4 1 1 2 / 



(95) 



Let us consider the probability of arriving at any node of degree 1 by per- 
forming a random walk of length one from a randomly chosen initial vertex. 



Pi — PlPl-tl + P2P2^1 + P2P2^1 + -^3^3^! + P^Pi^l, 

11 12 1 /3 1\ 
= ° + 112 + 113 + Tlli + l)' 
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This is clearly consistent with the analysis of (I92|) . 

We can now begin to formulate the master equations for this model. Defining 

Pi as 

_ k k 

= (97) 

l^k ''i,k 

the attachment kernel for the External Nearest Neighbour algorithm be- 
comes 

= aci + (l-a)AiQ. (98) 

Substituting (|98]l into (fT5]l we obtain for the steady state node degree 

Ci_i(a + (1 - a)A-i(i - 1)) 

l + a + (l-a)Ai ■ ^ ' 

Substituting into ([TO]) we get 

/i^i,j(a + (1 - a)A-i(z - 1)) + hj-i{a + (1 - a)/3j-i(i - 1)) . . 

~ l + 2a + {l-a)m+jf3,) ''^ ^ 

hj-ija + (1 - a)f3j-i{j - 1)) + Cj-i(o + (1 - a)/?j-i(j - 1)) . 

l + 2a + {l-a){Pi+j/3j) ^> ' 
^1,1 = 0. (100) 

The non-linear terms resulting from /5 mean that a complete analytical so- 
lution for lij is difficult. We leave this as a future challenge. However, the 
formalism can be implemented in its non-stationary form numerically (it- 
eratively) with reasonable efficiency. As described in Section [Sj the master 
equations for the network growth in the link-space are given for i,j > 1 as 

(L.,(t + 1)) = L,,(t) + Q-f\^-(^^Q--^(^)^----^(^) 



Xi-i{t) Xj_i{t) 
Qi{t)Lij{t) _ Qj{t)Lij{t) 



(101) 



and for i = l,j > 1 as 

(Lij(t + i)) = Lij(t) + ej_i + 



ej„i(t)Lij_i(t) 



9i(t)Li,j(t) 9,(t)Li,,(t) 

Xi{t) x,{t) ■ 



(102) 
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Computationally, it is useful to rewrite these equations (jlOip and ()102|) in 
terms of matrix operations. We define the following matrices and operators: 
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... \ 
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I = 


the identity matrix, 
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lit) 



Hadamard elementwise multiplication such that 
for e = /(g) 5, Ci = figi, 
Elementwise divide such that for e = f g, 



fi/9i 


e(t) 0X{t), 
( 1 \ 

2 
3 

V '■■ } 



for gi / 0, 
otherwise, 



(104) 
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/ 1 \ 
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1 

V '■■ J 

10 p 

( 1 \ 
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2 
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3 

V '■■ ) 



diag(x) 



Xl 










































X4 



(105) 



Armed with these useful building blocks, we can represent the link-space 
master equations, (fT^ as 

(L(t + 1)) = ((F-I)diag(7(t)) + l)L(t) + L(t)diag(7(t))(E-I) 

+ Fdiag(G(t))K + K^diag(e(i))E. (106) 

Note that we have not lost generality in specifying the attachment kernel 
0(t) = 7(t) ® X{t). Although ()106p looks somewhat intractable, the terms 
can be easily explained. Premultiplying by F has the effect of moving all the 
elements down one place. Postmultiplying by E moves the elements right. 
The last two terms represent the Q term in (|102p . 

We can now tailor for the three systems. For random attachment we 
have 



m = 
lit) = 

For preferential attachment we have 



1 



N{t) 
1 

W) 



m 

lit) 



1 



2(iV(t) 
1 



1 



Mt), 
1. 

-p(g)X{t), 



(107) 



(108) 



2(iV(t)-l)^- 

Things start to get a little trickier for the external nearest neighbour algo- 
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rithm yielding 



Recall from (ETI) that 



m = (L(%)0(L(t)l). (110) 

Actual computation is somewhat more straight-forward than the mathe- 
matics suggests and is based upon the assumption that we can interchange 
(L(t -|- 1)) with L(t -|- 1) to enable a recursive process for the evolution of 
the link-space matrix. Whilst this would not be strictly apppropriate to 
model the evolution of a single realisation of the algorithm or even an en- 
semble average (each realisation would have a path dependent evolution), if 
a steady state is reached (albeit approximately) then this will be a steady 
state of the algorithm. This is a similar assumption to that made in the 
node-space master equation approach to non-equilibrium networks in Sec- 
tion O We seed with Li^i(l) = 2 reflecting two connected nodes (such 
that N{t) = t + \ and total links, M = t). To account for the finite size of 
the matrices involved, the link-space is normalised to 2 {N(t) — 1) at each 
iteration although in practice, the very edges of the matrix where numbers 
might fall off have very small values. We retrieve the degree distribution c 
from the link-space matrix. 



Xiit) 



X{t) = ^0L(t)l, 

At) = (in) 

Comparison of the matrices L(i -|- 1) and L(t) or the vectors c{t + 1) and 
c(t) gives an idea of the proximity to the steady state. The results can be 
seen in Figure [Ml and appear (qualitatively) comparable to the numerical 
numerical simulations of Figure [531 

We can now use the link-space formalism to deduce the parameter value 
at which our algorithm approximates the BA degree distribution. At this 
value a = Cc, the node-degree distribution goes from higher to lower power- 
law exponent in relation to the BA model. For this parameter value, the 
attachment probability to nodes of various degrees is approximately equal 



51 




Figure 24: Degree distributions generated using the link-space analysis of 
our one-step algorithm. Dashed line: results for the Barabasi-Albert (BA) 
[1] preferential-attachment algorithm. Our one-step algorithm ressembles 
the BA results with parameter value a = 0.25 (see inset). 



for both our mixture algorithm and the BA model. Recall from Section [5.21 
and (IMl) 

^ ac + (l-ae)AL (112) 

For large i we have 

- 1-^. (113) 

We can use the exact solution of the link-space for the preferential attach- 
ment algorithm (or read off the graph in Figure [25]) to infer /3j in the high 
i limit. Using the first two terms of (j52p we obtain /? ~ 0.66 (the graph 
asymptotes to 0.65) and substituting into ()113p gives ac ~ 0.25. It is im- 
portant to note that /3j is a function of i (as is evident in Figure [25]) which 
in turn is non-trivially dependent on Oc. Consequently, it is unlikely for 
(|112p to hold for all values of i. Whilst the mixture model approximates the 
preferential attachment distribution at this value, the actual distributions 
are not identical. 

10 Concluding remarks and discussion 

In conclusion, we have developed a new formalism which accurately accounts 
for the node-node linkage correlations in networks. We have employed the 
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Figure 25: The value of /3j for the preferential attachment algorithm. 



formalism to produce analytic solutions to the link-space correlation matrix 
for the random attachment and BA models of network growth as well as 
the classical random graph. We have also derived the form of a perfectly 
non-assortative network for arbitrary degree distribution and illustrated the 
possiblility of a steady-state degree distribution and link-space for decaying 
networks. 

We have used the formalism to dispel the myth that a one-step random 
walk from random initial position in an arbitrary network is the same as 
selecting a node with probability proportional to its degree. By definition, 
only a perfectly non-assortative network, as derived in Section 16.11 would 
satisfy this condition. 

The link-space formalism has allowed us to describe a simple one-parameter 
network growth algorithm which is able to reproduce a wide variety of de- 
gree distributions without any global information about node degrees. There 
are clearly many alternative yet similar algorithms which could be designed 
which might also be analysed with the link-space formalism. These might 
provide insight into certain real world networks which not only exhibit simi- 
lar scaling behaviour but might also have similar constraints in their micro- 
scopic growth rules. 
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